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ABSTRACT 

Counter to intuition, the images of an extended galaxy lensed by a moving galaxy 
cluster should have slightly different spectra in any metric gravity theory. This is 
mainly for two reasons. One relies on the gravitational potential of a moving lens being 
time-dependent (the Moving Cluster Effect, MCE). The other is due to uneven mag¬ 
nification across the extended, rotating source (the Differential Magnification Effect, 
DME). The time delay between the images can also cause their redshifts to differ 
because of cosmological expansion. This Differential Expansion Effect is likely to be 
small. Using a simple model, we derive these effects from first principles. 

One application would be to the Bullet Cluster, whose large tangential velocity 
may be inconsistent with the A CDM paradigm. This velocity can be estimated with 
complicated hydrodynamic models. Uncertainties with such models can be avoided 
using the MCE. We argue that the MCE should be observable with ALMA. 

However, such measurements can be corrupted by the DME if typical spiral galax¬ 
ies are used as sources. Fortunately, we find that if detailed spectral line profiles were 
available, then the DME and MCE could be distinguished. It might also be feasible to 
calculate how much the DME should affect the mean redsliift of each image. Resolved 
observations of the source would be required to do this accurately. 

The DME is of order the source angular size divided by the Einstein radius times 
the redshift variation across the source. Thus, it mostly affects nearly edge-on spiral 
galaxies in certain orientations. This suggests that observers should reduce the DME 
by careful choice of target, a possibility we discuss in some detail. 
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1 INTRODUCTION 


The standard ACDM paradigm (Ostriker & Steinhardt 


1995) still faces many challenges in reproducing galaxy scale 


observations (for a recent review, see Famaey & McGaugli 


20121. Particularly problematic is the anisotropic distribu¬ 


tion of satellites around Local Group galaxies, a question re¬ 
cently revisited in detail (Pawl owski et al.|2014 |. A different 
analysis focusing on Andromeda came to similar conclusions 
( Ibata et al.|2014 |. The relevant observations for the Milky 
Way (Pawlowski & Kroupa 2013) and Andromeda (Ibata 
et al. [2013 1 are difficult to repeat outside the Local Group 
because of the need to obtain 3D positions and velocities. 

On a larger scale, [Cai et al.| ( |201 4) found that the colli¬ 
sion speed distribution of interacting galaxy clusters can be 
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quite sensitive to the underlying law of gravitation. Thus, 
the high collision speed of the components of the Bullet 
Cluster 1E0657-56 ( Tucker et al.||1995 l has been argued in 
favour of modified gravity (Katz et al. 20131. However, this 


speed is not directly measured as the collision is mostly in 
the plane of the sky. Instead, the speed is estimated using 
simulations of the shock generated in the gas by the collision. 
The separation of the DM and gas ( Clowe et al.||2006 ) also 
plays an important role - there is less gas drag at lower 
speeds, so the separation is generally reduced. 

A collision speed close to 3000 km/s is thought to be 
required to explain the observed properties of the Bullet 
Cluster (Mastropietro & Burkert 200 8|). For the inferred 
masses of the components (Clowe et al. 12004 1, this appears 


difficult to reconcile with ACDM (Thompson & Nagamine 


2012). This work suggested that a cosmological simulation 
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2 Indranil Banik & Hoiigsheng Zhao 


requires a co-moving volume of (4.48/i _1 Gpc) 3 to see an 
analogue to the Bullet Cluster. 

The recent work of Lage & Farrar (2014a) finds a similar 
collision speed but suggests a higher mass for the Bullet 
Cluster’s components. While higher mass objects are likely 
to collide faster, such heavy clusters are rare in cosmolog¬ 
ical simulations. For example, their own unpublished work 
(Lage & Farrar 2014bI based on the Horizon Run N-body 
simulation (Kim et al. 2009I showed how there were only 
7 cluster pairs with masses comparable to their higher esti¬ 
mate for the Bullet Cluster mass. Because a larger volume of 
(6.59/i _1 Gpc) 3 was used in the simulation and (f^|) ~ 3, 
this result is not very surprising in light of previous works. 

However, some recent unpublished studies have raised 
the probability estimate of observing a galaxy cluster merger 
wit h properties comparable to the Bullet Cluster. [Bouillot] 


et al. (20141 used a larger box size of (21/i _1 Gpc) 3 , using the 
DEUS-FUR simulation. Thompson et al. (|2014 | took issue 
with the Friends of Friends algorithm ( |Davis et al.|[l985| ) 
long used to search outputs of N-body simulations for ana¬ 
logues to the Bullet Cluster. After switching to the recently 
developed ROCKSTAR algorithm (Behroozi et al. 20131, 
the rate of occurrence of analogues to the Bullet Cluster 
increased by a factor of ~100. Despite this, |Thompson et al.| 
(20141 quoted the probability of a collision as fast as the 
observed one as only 1 in 2170, which is still fairly small. 

Moreover, a few other massive colliding clusters with 
high infall velocities have been discovered in the last few 
years (|G6mez et al.||2012| |Menanteau et al. ||2012| |Molnar| 
et al.|2013b| ). The El Gordo Cluster (ACT-CL J0102-4915) 


may be particularly problematic due to its combination of 
high redshift (z = 0.87, |Menanteau et al.|[2012[) , high mass 


(Jee et al. 20141 and high inferred collision speed (Molnar 
fc Broadhurst|2015 I. 


A detailed analysis of how likely it is that observers 
would have seen interacting clusters with the observed prop¬ 
erties is still lacking. One would need to account for in¬ 
complete sky coverage and perhaps faster collisions being 
easier to discover due to greater shock heating of the gas. A 
key input to any such analysis must be the collision speeds 
of the components. This work focuses on measuring cluster 
motions more accurately. 


Molnar et al. (2013b I argue that inferring collision 


speeds from observations of the shock can be non-trivial 
just due to projection effects, let alone other complexities 
of baryonic physics. To see if there is any tension with the 
KCDM model, the collision speeds should be determined in 
a more direct way. Ultimately, we would like to determine 
the proper motions of colliding clusters. 

Although not feasible by traditional methods, such 
motions may be inferred using the Moving Cluster Effect 
(MCE, Birkinshaw fe Gull||1983 l. As derived later, this ef¬ 
fect relies on the gravitational potential of an object being 
time-dependent due to its motion. Consequently, if a source 
behind the object were multiply imaged, the images would 
have slightly different redshifts. Moreover, as the dark mat¬ 
ter generally outweighs the gas on cluster scales (|Blaksley fc| 


Bonamente 2010), the MCE is mostly sensitive to the motion 


of the dark matter. This is simpler to model than the gas, 
making the results easier to compare with simulations. 

The MCE would likely be around 1 km/s for the Bullet 
Cluster (Molnar et al.||2013al. The Effect may be searched 


for using Cosmic Microwave Background (CMB) photons 
(e.g. Cai et aI7||2010 1. However, as noted by those authors, 
temperature anisotropies in the CMB make it difficult to 
spot such a signal around an individual object. Thus, we 
focus instead on using a multiply imaged background galaxy 
as the source. Spectral features in this galaxy could be used 
to determine the redshifts of its multiple images. 


We consider the feasibility of obtaining measurements of 
the required accuracy using ALMA in Section [ 7 ] Measuring 
this Effect seems to be within our reach. One might instead 
conduct the observations in the visible/near-IR with large 
spectroscopic instruments such as the TMT and E-ELT. 


Therefore, it is important to consider other effects that 
might also cause the redshifts of double images to differ. Per¬ 
haps the most important such mechanism is what we term 
the Differential Magnification Effect (DME). This depends 
on details of the source. If this is a rotating disk galaxy 
not viewed face-on, then different parts of the source have 
different radial velocities and hence redshifts. 


The lens magnifies the source non-uniformly. The exact 
way in which this occurs is different for each image. Conse¬ 
quently, the intensity-weighted mean redshift of the images 
is usually different. 

If one could perform Integral Field Spectroscopy of the 
source galaxy accurate to ~1 km/s, then one would simply 
need to compare the redshift of the same part of the galaxy 
between the two images. By focusing on a small part of 
the galaxy, the DME would be greatly reduced. However, 
this will be a challenging observational goal. The high spec¬ 
tral accuracy demanded by MCE measurements means the 
source will likely be spatially unresolved in the near future. 

Assuming this to be the case, we determine the order 
of magnitude of the DME for a typical disk galaxy. We find 
that it may well be significant. Thus, we explore exactly how 
it affects the profiles of individual spectral lines. The Effect 
is quite different to the MCE, which simply shifts each line. 
This may provide a way to correct for the DME and also 
to verify that a redshift difference is indeed caused by the 
MCE. Without spectra detailed enough to see such small 
differences between line profiles, it might still be possible to 
calculate the DME, though the determination would be less 
secure. 


The additional complications and uncertainty intro¬ 
duced by trying to correct for the DME necessitate a dis¬ 
cussion on how it may be reduced. Aside from the obvious 
steps of using ellipticals/face-on spirals and smaller — likely 
slower-rotating — galaxies, an important factor to consider 
is how much the magnification varies across the source. The 
larger the variation, the larger the DME. 


For this reason, an edge-on fast-rotating spiral galaxy 
might still be a good target if it is oriented so the magni¬ 
fication is nearly constant across the image. At the other 
extreme, the magnification varies rapidly near a caustic. 
Therefore, caustic images are likely to be strongly affected 
by the DME ( Molnar et al.(2013a l. 


We emphasise the need to model both the redshift struc¬ 
ture of the source and the deflection map of the lens when 
trying to use precise lensed image redshifts to determine the 
tangential motion of the lens. 
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Effects of Lens Motion and Uneven Magnification on Image Spectra 3 



Figure 1. The lensing geometry is depicted here. Upper photon 
trajectory = primary image (same side as unlensed source), lower 
trajectory = secondary image. Relevant distances are indicated at 
bottom. The lens is treated as a point mass moving transversely 
to the viewing direction. The source is an extended disk galaxy. 
There is a redshift gradient across it due to rotation. 


2 THE MOVING CLUSTER EFFECT IN 
STANDARD AND NON-STANDARD 
GRAVITY 

2.1 The Lensing Geometry 

Figure [T] illustrates the basic geometry that will be consid¬ 
ered here. Because we are mostly concerned with angles on 
the sky, the distances relevant to us are the angular diameter 
distances to the lens and source (D; and D s ). Also impor¬ 
tant is the angular diameter distance to the source galaxy 
as perceived by an astronomer at the lens, measured at the 
epoch that the lens is currently observed at ( zi ). This last 
distance we denote Di s . 


2.2 Including Lens Motion 

Suppose that the lens moves transversely to OL. Thus, 
one of the light paths gets ‘stretched’ while the other gets 
‘squeezed’, leading to a redshift difference between the im¬ 
ages. To calculate this effect, we make the thin-lens or trian¬ 
gle approximation whereby each photon trajectory is treated 
as two straight lines. In this case, the light arrival time sur¬ 
face is generally given by (Kovner[l990) 


cT(0) = constant + 0 - - <k(A6> - xi) (1) 

2 Du, 

This consists of a geometric part and a relativistic part 
due to the lensing potential >k. We assume that 'k depends 
only on position relative to the lens, which is located at xi 
relative to some reference point in the lens plane. 

The path length cT(0) can be thought of as a function 
of the lens plane position 0 hit by a ray from the source. 
The actual rays for the images are at the extrema of this 
function (Fermat’s Principle). 

We briefly describe how the geometric part of Equation 
[I] is derived. Each section of a hypothetical undeviated pho¬ 
ton trajectory can be mapped onto a part of the actual tra¬ 
jectory. The latter is slightly longer as there is an extra factor 
of the secant of the angle between them. This is expanded at 
second order, as the angle is small. The angle is (9 — /3) for 
the stretch LO while for OS, it is (9 — /3) ff- . The last 
factor arises because photons emitted in the same direction 
gradually get further apart due to cosmic expansion. Thus, 
photons emitted in different directions end up more widely 
separated than in a static Universe. 

Equation |T] follows most naturally if using co-moving 
distances, which can be added simply. This fact leads to the 


very useful relation between angular diameter distances 
(1 + zi) Di + (1 + z s ) Di s = (1 + z s ) D s (2) 

If the lens moves in the transverse direction, then xi 
changes and so the lensing potential T changes at every 
point in the lens plane. Thus, transverse motion of the lens 
would cause the arrival time to change according to 


cT = x j ■ V\k = —vt ■ a 


(3) 


a = — Vfk is the unreduced (true) deflection angle and 
v t = xi is the transverse velocity of the lens (note time here 
refers to that measured by a clock at the lens). 

The rate of increase of the path length cT is equivalent 
to a shift of the intrinsic spectrum of the source. While the 
source’s intrinsic spectrum can’t be directly measured, the 
relative spectra of the images in a multiple-image system 
can. Images 1 and 2 would have a relative redshift velocity 


SV r = Vi - Vi = — v t ■ (ai — C*2) 


(4) 


where cti — c *2 is the relative deflection angle between 
the images. The observed angular separation between the 
images is ff times as much. This allows the MCE to be 
calculated without knowing what the deflection angles are, 
as long as one is sure of the identification of the double image 
and the distances to the lens and source. 

So far, we have used time measured by a clock at the 
lens. Using one on Earth instead would introduce a factor of 
(1 + 2 ;) to the time delay. Putting it in, we should think of vt 
as the transverse peculiar velocity in co-moving coordinates. 
This is the co-moving lens distance times the relative proper 
motion of the lens with respect to that of the source^] This 
takes account of transverse motions of the observer and the 
source. In Section |3.2| we show that such motions affect 
image redshifts much less than motion of the lens. 

For multiple lens planes, one would simply add the red¬ 
shift differences due to each plane. 

The above derivation is a property of metric theories 
of gravity. Hence, it is independent of details of the theory, 
something we now show. Consider a static Universe with 
no observer-source relative motion. Use a reference frame 
moving with the lens, so the source and observer both appear 
to move at —xi. In general, there is a Doppler shift for a 
photon emitted by the source as perceived at the lens. A 
similar effect arises between lens and observer. The shifts 
cancel if the photon is not deflected by the lens (if the source 
emits a photon ‘backward’, then the observer ‘ploughs into’ 
the photon). 

However, if the photon is deflected, then there is a net 
frequency shift between source and observer. This shift is 
different for another photon which gets deflected by a differ¬ 
ent amount. Therefore, the difference in deflection angles 
determines the redshift difference between the photons. If 
observers could be sure the photons initially had the same 
energy, then the redshift difference could be directly mea¬ 
sured, thus constraining xi. 

For small deflections by a non-relativistic lens, the result 
in an expanding Universe is exactly the same as in a static 
one, once all the angles have been properly accounted for. 

1 Physically, it would be the peculiar velocity of the lens in the 
direction orthogonal to our line of sight, in the absence of peculiar 
motions of either observer or source. 
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2.3 Point mass lenses 


The unlensed source and images all lie along a line, so we 
only consider positions along this line. For a point mass 
lens, one can generically say that a ray of light with impact 
parameter Di9 is deflected by 


4 GM 
c 2 D:9 


(5) 


3 CAUSES OF DIFFERENCES BETWEEN 
MULTIPLE IMAGE REDSHIFTS 

Before deriving the DME, we briefly consider a few factors 
that can affect redshift differences between double images of 
a source strongly lensed by the Bullet Cluster. 


3.1 Lens Motion 


M is the ‘equivalent lensing mass’ in alternative gravity. 
Different gravity theories require different amount of real 
mass M to produce the observed equivalent mass M, the 
Einstein radius 9e or the deflection angles ai, 2 - In general, 
M depends on position in modified gravity theories, even 
with a point mass. For simplicity, we neglect this. 

Combining Equations [4] and [5j we get that 


_ 4 GMv t f 1 

“ c 2 Di ^ 



( 6 ) 


Note that the signs of 9 1 and 02 are opposite because 
the images are on opposite sides of the lens (Equation |10[ ). 

Vt is the component of the transverse velocity vt pro¬ 
jected along the line connecting images 1 and 2. If the lens 
proper motion is orthogonal to the image separation, then 
Vt would be zero. 

Noting that deflecting a photon at the lens only affects 
part of its trajectory, we get the classical lens equation 


The MCE is maximal for a source displaced from the lens 
in the direction of its proper motion. This direction can be 
determined from images of the shock fairly easily. Moreover, 
observers should select targets to maximise the MCE. Thus, 
we assume the double images are indeed separated along 
the direction of motion of the lens. Otherwise, the MCE is 
reduced by the cosine of the angle between them (Equation 


0' 

With these assumptions, we combine Equations Ril and 
|10| to get that the difference in redshift velocity between the 
two images is 




2 vtyjGM (u 2 + 4) r~D. 


DisDi 


(13) 


Using parameters appropriate to the Bullet Cluster (Ta¬ 
ble [lj), the effect is around 1 km/s. 


3.2 Source & Observer Motion 


P 


D ls AGM 
D 3 c 2 Di9 

9 — —where 9e = 
6 


AGM Du 
c 2 DiD : 


(7) 

( 8 ) 


The Einstein radius 9e defines a typical angular scale 
for the problem. It will be convenient to use this to normalise 
all relevant angles. Thus, we let 



and 



Images are formed where 
V = \ ( W ± ^ U2 + 4 ) 


(9) 


( 10 ) 


For later use, we note that the magnification of a small 
part of the source located at an unlensed angular position 
of /3 = u9e relative to the lens is given by 


A 


0 09 

pop 


1 

2 


u 2 + 2 
u\Ju 2 + 4 



( 11 ) 

( 12 ) 


The result follows from the surface brightness of an un¬ 
lensed source and a lensed one being equal (Liouville’s Theo¬ 
rem). Thus, A is the Jacobian of the mapping between where 
objects appear on the sky and where they would without a 
lens. The modulus signs are needed because otherwise A < 0 
for the secondary image (indicating that it is inverted). 

The secondary image becomes very faint if u 1 (in 
this case, A ~ -%). It is difficult to find a source with u <C 1 
as this corresponds to a very small part of the sky. Thus, 
any source used for MCE measurements will very likely have 
u ~1. We assume this is the case. 


The peculiar velocity (w.r.t. the CMB) of the Sun is well- 
known (369 km/s, Planck Collaboration |2014b) and could 
be included in a more careful analysis. We do not include it 
as we only seek a rough idea of the magnitude of the MCE. 
This won’t be substantially affected by observer motion as 
this is much slower than that of the lens (~3000km/s). 

More problematic may be the unknown peculiar veloc¬ 
ity of the source. Treating the Local Group peculiar veloc¬ 
ity (~630 km/s) as typical for galaxies, the Bullet Cluster 
transverse motion likely greatly exceeds the source’s peculiar 
velocity. In this case, only the component of this velocity 
parallel to the lens transverse motion has much effect on 
image redshifts, leading to a factor of \ on average^ 

Another factor of jxJ- Jy- ~ | arises due to the geome¬ 
try of the situation and cosmic expansion. Moreover, typical 
peculiar velocities were smaller long ago. Supposing they 
were | as much as today at z s = 1.5, we see that source 
motion can’t affect the inferred lens velocity much more 
than ~50 km/s. This effect can be reduced by observing 
more than one double image pair. However, we consider the 
accuracy with even just one well-observed pair sufficient. 

Thus, we ignore any motion other than that of the lens. 
We note that it might be good to avoid source galaxies which 
are interacting, as their peculiar velocities might be higher. 


3.3 Cosmological Expansion 

A redshift difference between the images can also arise be¬ 
cause the time of flight of photons emitted by the source is 
different depending on which path they took to get to Earth. 


1 For an angle between a fixed vector and another statistically 
isotropic one, (| cos 0 \) = /. 
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As the photons for both images arrive simultaneously, the 
photons for one image must have been emitted earlier than 
for the other. Thus, in an expanding Universe, one of the 
images will have a higher redshift. Due to both a longer path 
length and a stronger gravitational field along the path, this 
image is the secondary (on the opposite side of the lens as 
the unlensed source would appear — see Figure [l]). We term 
this the Differential Expansion Effect (DEE). 

A quick way to estimate the DEE is to assume that cos¬ 
mological distances like Dl are usually . The extra path 
length ~Dn# 2 . The effect of the difference in gravitational 
time delays can crudely be approximated as equal to that 
due to different geometric path lengths. 

The DEE expressed as a redshift = HAt « d 2 . Mean¬ 
while, the MCE Assuming a velocity of 3000 km/s 

and an image separation of 20”, we see that the MCE is 
~50 times larger than the DEE. Thus, we calculate the DEE 
more precisely. 

We first consider just the difference in time of flight 
(‘delay’) due to different strengths of gravity along the two 
possible photon paths (Shapiro] 1964). The relative Shapiro 
delay between the images is 


At, = ^-Ln (£ 
c 3 \b i 


D„, Dis 6 i : 2 


(14) 


The impact parameters of the rays are 61 , 2 - This result 
is valid if b is much larger than the Schwarzschild radius of 
the lens, so the rays are only weakly deflected. The ray with 
smaller b is delayed more as it goes deeper into the lens’ 
gravitational potential well. It also has a longer geometric 
path length (it forms the secondary image in Figure [lj. 

Note this is the time delay at the lens. In reality, both 
photons must reach Earth now, so the time of emission at 
the source must have been different. This requires an extra 
factor of the relative rates of a clock at the lens and at the 
source, jEr- 

We then combined this Shapiro delay with the geomet¬ 
ric path difference between the trajectories. Thus, the dif¬ 
ference in time of emission required for photons traversing 
the two trajectories to reach Earth simultaneously is 


A t a = 


1 + zi 2GM 
1 + z s c 3 


u\/u 2 + 4 + 2 Ln 


^ u 2 + ufiu 2 + 4 + 2 ^ 


(15) 


We used Equation [To] to relate the source position u 
to the positions of its images. A correction for cosmological 
time dilation was also applied. 

The change in redshift is given by the fractional dif¬ 
ference in the scale factor of the Universe at the time of 
emission of the photons. 

Az — H(zs)At s (16) 

Using realistic parameters (Table [I] and u ~ 1), the 
DEE ~1 m/s. In Section [3.1| we showed that the MCE is 
~1000 times larger, allowing us to neglect the DEE. 

Even with more accurate instruments, a very large num¬ 
ber of double image pairs would need to be observed before 
the random noise from source peculiar motions was reduced 
below such a small level. Thus, the DEE won’t be important 
in the foreseeable future. An exception might possibly arise if 



Figure 2. The observing geometry is shown here. The source 
galaxy has centre O and normal to its plane ON. Earth is towards 
OE, so the galaxy’s inclination to the sky plane is i. OQ and OP 
are in the galaxy’s plane and orthogonal to each other, with OQ 
as closely aligned with OE as possible. Thus, OP and OE are 
orthogonal. VA is directed within the source plane, so must also 
be orthogonal to OE. VA is at an angle 7 to OP. The source 
is parameterised using cylindrical polar co-ordinates (r, 0). with 
centre O and initial direction (0 = 0) along OQ. 


the source galaxy peculiar motion could be estimated based 
on properties of galaxies near it. 


4 DERIVATION OF THE DIFFERENTIAL 
MAGNIFICATION EFFECT FOR 
UNRESOLVED IMAGES 


The effects mentioned in Section [3] cause the frequencies of 
identical photons emitted in different directions to end up 
different when measured at the Earth. The DME does not 
do this. It is merely an observational artefact due to inability 
to simultaneously resolve the images and take highly accu¬ 
rate spectra of them. This causes parts of the source with 
different redshifts to get blended together in the spectra. 
The precise way in which this blending occurs is different 
between the images. 

We assume the spectra are integrated over the entirety 
of each image. The source is modelled as a typical spiral 
galaxy with exponential surface density profile and a real¬ 
istic rotation curve. The lens is treated as a point mass. 
The parameters considered (Table PH) are designed with the 
Bullet Cluster (Tucker et al. 1 995 ) in mind. 

The basic idea is that spatially unresolved spectra can 
determine the intensity-weighted mean redshift of each im¬ 
age. This may be affected by rotation of the source galaxy. 
The effect isn’t reliant on an expanding Universe, so it will be 
simplest to think of the Universe as static for the remainder 
of this section. 

The mean redshift velocity of each image is given by 


f T AEv r dS 

- J Image 

V r = 


h 


Image 


AE dS 


(17) 


The integrals are over area elements of the source S. 
This is treated as a disk with surface density 


E = Eq e rd 


(18) 


The magnification A varies little over the source galaxy. 
This is because (see Table |TJ) . Thus, a linear ap¬ 

proximation to A is sufficient. 

dA 

A « Aq + -^—du (Ao = A at centre of source) (19) 
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In our model, A varies linearly with position in the 
source plane, but only in the direction directly away from 
the (projection of the) lens. In the orthogonal direction, A 
is independent of position at first order (because u is, and 
A depends only on u ). 

The geometry of the source is shown in Figure [2] The 
radial velocity of any part of it is 

v r = v c (r) sin 0 sini ( 20 ) 


Only the component of Vi along OP is important. To 
see why, suppose that Vi was entirely along OQ. Reflecting 
the galaxy about the line OP without altering V.4 should 
reverse the DME as this is equivalent to reversing Vi. How¬ 
ever, the radial velocity of every part of the galaxy remains 
unaltered after the reflection (as <f> —> tv — $). Thus, the DME 
must also remain unaltered. 

Noting that the component of Vi along ON is irrele¬ 
vant for the DME, we see that only the component along OP 
might be relevant. This component causes the approaching 
and receding halves of the galaxy to be magnified differently. 
It will be responsible for the DME. Thus, we assume the 
lens is located somewhere along the line OP, making Vi 
entirely along this direction. The result is then multiplied 
by cos 7 . 

The magnitude of the DME is therefore oc cos 7 sini. 
Assuming isotropy, all values of 7 are equally likely. But 
values of i close to ^ are more likely because there are more 
ways for two vectors to be orthogonal than to be aligned. 
This means the ratio between the average magnitude of the 
DME and the maximum it could be is given by the mean of 
| cos 7 sini|, with 7 unweighted but a further sini weighting 
over iJ^Thus, 


(| sini COS 7 I) 


2 

J cos 7 d'y 
0 


f d 1 

0 


X 


7r 

f sin 2 % di 
0 

7T 

f sini di 
0 


1 

2 


( 21 ) 


The angular separation between the lens and the un- 
lensed source is given by 


u = u 0 + 


r.OP 

D s 8e 


(u 0 = u at centre of source) 


( 22 ) 


The component of r (measured from the galaxy’s cen¬ 
tre) along OP is rsincf). We have not kept careful track of 
signs because, for any conceivable orientation, the source 
galaxy could be rotating in the opposite sense, thereby re¬ 
versing the DME. We explain which image has a lower red- 
shift due to the DME later. 

The difference in u between the centre of the source 
galaxy and any other point in it is given by 


du 


r sincj) 
D s 6e 


(23) 


Aq represents a constant magnification across the 


source. This does not contribute to the numerator in Equa¬ 
tion [Tt] because the radial velocity v r oc sin rf), while E is 
independent of (j> due to axisymmetry. Thus, integrating over 
(j> gives 0 ( 2 ] The DME arises when including the first order 
correction to A. 

The denominator in Equation [IT] is a normalisation for 
each image (its total intensity) [^] Because the magnification 
is nearly constant across the source galaxy, we can approx¬ 
imate that A = Ao- The first order correction to A would 
have a sin cj> dependence, which is irrelevant when integrated 
over all <f>. This further justifies our approximation. There¬ 
fore, the denominator in Equation |17| becomes 


L 


Image 


AT, dS = So7rr,j 2 


u 2 + 2 
u\fv? + 4 



(24) 


To understand the sign of the DME, firstly note that 
regions closer to the lens are magnified more. In our ap¬ 
proximation, the numerator in Equation [lT] is determ ined 
by which is the same for both images (Equation 12 1 . 
Therefore, the image with the lower magnification (the sec¬ 
ondary image) has a larger |tv|- Thus, if it was known which 
side of the rotating source was the approaching side, one 
could determine which image should have a higher mean 
redshift due solely to the DME. 

Including the second order dependence of A on sky po¬ 
sition slightly alters the calculations done so far. Because a 
second order term does not affect the approaching and reced¬ 
ing halves of the source galaxy differently, the numerator in 
Equation [TT] is unaltered. But the denominator is affected, 
because the total intensity of each image may be altered 
by a second order term. This means that our derivation of 
the DME has a fractional error which is second order in 
D r g . We consider this acceptable & proceed to develop a 
model for the redshift structure of the source. This requires 
a rotation curve. 


4.1 Model Rotation Curves 


It will likely be difficult to directly observe the source galaxy 
rotation curve v c {r) as it is very far away. It is also difficult 
to precisely determine its surface density and thus predict 
the form of v c (r). Fortunately, we are considering a disk- 
integrated spectrum and so the exact shape of v c (r) will 
turn out not to be very important once the maximum level 
Vmax is fixed. 

To get a rough idea of v c (r), we take advantage of the 
tight empirical relation between the forces in rotating disk 
galaxies as required to sustain their rotation curves and 
those predicted by Newtonian gravity based on the visible 
(baryonic) mass (Famaey & McGaugh 2012 and references 
therein). This empirical formalism goes by the name of Mod¬ 
ified Newtonian Dynamics (Milg rom||1983[ ). Regardless of 
whether it is correct at a fundamental level, it does seem to 
provide a good empirical way of predicting rotation curves. 
Here, this is important because measuring the actual rota¬ 
tion curve of the source galaxy would be very challenging. 


1 Edge-on galaxies are less likely to be detected due to dust ob¬ 
scuration. This makes low values of i - and thus a smaller DME 
- more likely, for a randomly selected multiple image. 


2 This is expected, as the DME doesn’t arise if the image is 
uniformly magnified. 

3 What we perceive as the total intensity given D s and z s , but 
without correcting for magnification by the lens. 
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Th e particular empirical rela tion we adopt follows the 
work of Famaey & Binney (20051 ^ 


Iff! + oo 


9 n 


(25) 


g is the true gravitational field strength while gjv is the 
prediction of Newtonian gravity based on the visible mass, 
ao is an acceleration scale (« 1.2 x 10 _ 10 m/s 2 ) below which 
either gravity becomes non—Newtonian or dark matter must 
be considered in addition to the baryons. Thus, the magni¬ 
tude of the gravitational field is given by 


9 N , I(9n\ 2 

9 = T + \I\^J +9Nao 


(26) 


It is not worthwhile to accurately determine <?jv for any 
particular mass distribution because the actual mass distri¬ 
bution in the source is uncertain. Thus, we approximated gjv 
using an analytic method. We assumed that, to determine 
gN at a particular in-plane location, only material at smaller 
radii need be considered (we verified that the force from ma¬ 
terial at larger radii was very small). The Newtonian force 
at a distance r from the centre of a narrow ring of mass dM 
and radius x is 


G dM 3 G dM x 2 
9N « —pr~ + —— 


(.x < r, interior ring) (27) 


This is correct at second order in The total force 

r 

at any point within the disk was found by decomposing the 
galaxy into a large number of rings with dM = 2nx dx T,(x). 
We then summed only the forces resulting from interior 
rings. Therefore, 


9n 


r 


fir) 


r ( G dM 3 GdMxf\ 

Jo V r2 + 4r4 / 

27rGEo/(r) where 

— and 
Vd 

1 - 7e -fr 9(1- e“ F - ¥e~ 7 ) 

r 2 4r 2r 4 


(28) 

(29) 

(30) 

(31) 


When obtaining the true value of g from gjv, the ratio 
is important. Therefore, we introduce a new variable, 
the dimensionless density k. 

k = (32) 

ao 

Typical values for k are order 1. Using the empirical 
Equation [26] to get g from gN, 




Figure 3. Top: Rotation curves resulting from Equations |31| fe 
|36| used in this work. v c (r) flatlines at Vf. The surface density 

E = Eoe r d . The parameter k controls the shape of the rotation 

curve (k = Sf)G ). Bottom : The ratio of maximum to flatline 
v a o ' 

rotation speed as a function of central surface density. 


rotation curve is given by 


v c (r) 


Vcjr ) 

Vf 


■\J nkrf(r) + r\J ( nkf(r) + l) 2 —~f 


\Z2nk 


(36) 


4.2 The Final Result 

Combining our results, we get that 


— = rkf{r) + J(nkf(¥) + 1) 2 -1 (33) 

ao v 

To get the rotation curve, we equate g with the cen¬ 
tripetal acceleration. Thus, 


v c 

r r d 

Vf = f/2nky/r d ao 


= W ao 


(34) 

(35) 


The rotation curve flatlines at the level Vf = fiGMao, 
where the total disk mass M = 27tEq r 2 . The shape of the 


1 This is the so-called ‘simple g- function’ in MOND. 


VfTd smi cos 7 o o 


ocS 

oo 

/ / e_r vi(r)r 


BA 

Ou 


u 2 (u 2 + 4 ) 2 


- sin 2 (j> d(p dr 


D s 9e 


u 2 + 2 
Mu 2 + 4 


± 1 


(37) 


The magnification A changes by order 1 over an angu¬ 
lar distance of 8e, while the angular radius of the source 
galaxy Thus, the DME as a fraction of the typical 

radial velocity of the source is ~ g r g . The galaxy’s typical 
radial velocity is Vf sin i. Another factor of cos( 7 ) is needed 
to account for the lensing geometry. As can be seen from 
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Equation 37 this provides a rough guide to the DME (as 
u ~1 for a realistic target). 

An important quantity for the DME is the difference in 
-jj between the images. 


_ dA 

A du ) du 


\Ju 2 A 4 


(38) 


In Equation |37| the integration over </> yields n. The 
integral over r is not analytic. Thus, we define 

poo 

I = / e~ r v c (r) r 2 dr (39) 

Jo 

Substituting for 9e using Equation [5] we get that 

(40) 


Av r 


Vf rd sin i cos 7 I c \fDi 
VGMDuDs 


The integral I depends somewhat on the central surface 
density in the sense that, for the same Vf, the DME is greater 
at higher k. However, the maximum rotation speed is very 
well correlated with the DME. In fact, the ratio - ( — = 

’ "max 

1.89 ± 0.02 for k = 0.1 —> 5. As maximum rotation speeds 
are generally easier to determine than the flatline level, this 
makes correcting for the DME easier. 

If the surface density declines sufficiently slowly with 
r, then the integral / might diverge. This is due to limited 
validity of a linear approximation to A — a more careful 
treatment would be required. This might apply to rotating 
elliptical galaxies with p oc r~ 4 . But even if E oc r -3 , the 
divergence of I is fairly slow. Thus, although the integral 
would need a cut-off radius, its precise value would not affect 
the result much. 

A linear approximation to A must break down if u 
changes by order 1. Thus, a logical cut-off might be the 
Einstein radius (at the source plane) or the distance between 
the source and the projected lens. 

If a fibre-fed spectrograph was used or the field of view 
was otherwise restricted, then this may impose an obvious 
cut-off. For an inclined disk galaxy, a circular aperture would 
cover a non-circular region in the disk plane. One might need 
to take this into account. 

We now decide on realistic parameters to gain a feel for 
the scale of the DME & MCE. Ideally, one would like to 
measure the motion of both components of the Bullet Clus¬ 
ter. However, we choose a mass corresponding roughly with 
the sub-cluster in the Bullet (Mastro pietro fc Burkert|2008 1 . 
This is because the centre of mass likely has little peculiar 
velocity as there is little structure on such large scales. Thus, 
the lower mass component probably moves faster (w.r.t the 
Hubble flow). 


The MCE oc v\fM (Equation 131. Assuming also that 


v oc jj for the components of the Bullet Cluster and that the 
value of u would be broadly similar whichever component 
is targeted, the MCE overall oc This makes it larger 
around the sub-cluster. Furthermore, using its motion to 
extrapolate the total collision velocity is much more reliable 
than using the motion of the main cluster, because the sub¬ 
cluster contributes most of the relative velocity. 

A typical source galaxy orientation is chosen using 




Figure 4. The difference in redshift between double images of 
a typical background galaxy as a function of its position, due to 
the effects described in the text (Equations |13| and |40| ). Parameter 
values used are listed in Table ^ The shape of the rotation curve 
(/.:) has a modest impact on the DME once its flatline level vj is 
fixed. If instead v max is held fixed, then the impact of k on the 
DME is very small. 


Table 1. Input parameters used for Figure^] The source galaxy 
is assumed positioned so as to maximise the MCE (i.e. it is sep¬ 
arated from the lens on the sky along the direction of motion 
of the lens, which is clear from images). The lens mass should 
roughly correspond to the sub-cluster in the Bullet. A flat A CDM 
cosmology is adopted (Planck Collaboration 2014a). 


Parameter 

Meaning 

Value 

H 0 

Present Hubble constant 

67.3 km/s/Mpc 


Present matter density 

0.315 

Di 

(Angular diameter) 
distance to lens at zi = 0.296 

0.945 Gpc 

D a 

Distance to source at z s = 1.7 

1.795 Gpc 

Dis 

Distance to source from lens 
position in spacetime 

1.341 Gpc 

M 

Mass of lens 

1.2 x 1O 14 M 0 

rd 

Scale length of source galaxy 

3.068 kpc 

Vt 

Tangential velocity of lens 

3000 km/s 

v f 

Flatline level of source galaxy 
rotation curve 

100 km/s 

sin i cos 7 

See Figure |2| and Equation |2l| 

1 

2 


results shown in Figure [4] The DME and MCE are compa¬ 
rable in magnitude. 

Observing a similar source multiply imaged by the 
higher mass component instead does not reduce the relative 
importance of the DME. This is because the DME oc 
(Equation |40[ ), just like the MCE. In fact, this scaling high¬ 
lights an additional problem: substructure in the lens (e.g. 
individual galaxies) with much less mass than the entire 
cluster can enhance the DME. For example, an elliptical 
galaxy in the lens plane with M = 10 13 M 0 would cause a 
DME ~3 times larger than the smooth cluster potential. 

This problem could be mitigated to some extent by not 
selecting images which show indications of being lensed by 
small scale structure (e.g. avoiding images appearing near a 
galaxy in the lens plane). We have implicitly assumed that 
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such a selection has been done, such that a point mass model 
for the lens is appropriate. Even in this case, it might well be 
necessary to correct for the Differential Magnification Effect. 
This correction would be less relevant if targets could be 
selected for which the Effect is small. We now consider how 
these things might be achieved. 


5 CORRECTING FOR THE EFFECT 

For spatially unresolved spectra, it is possible to calculate 
the DME by determining the parameters in Equation |40| If 
radial velocities accurate to a few km/s are obtained for a 
galaxy, then determining v max sin* to ~10 km/s should be 
feasible using widths of spectral lines (see later). 

rd might be obtained from an image of the target, once 
distortion and magnification by the lens were corrected for. 
If the image were taken at more than one wavelength, it 
would suggest a value for k (which we don’t need very ac¬ 
curately) as the colour can be used to estimate the baryonic 
M/L. 

There is no need to determine the inclination as we 
are only interested in redshift gradients across the source. 
However, the orientation of the major axis of the image is 
important in determining the axis of rotation and thus the 
angle 7 in Figure [2] 

To know the sense of rotation (i.e. which side of the 
source galaxy is the approaching side), we would need spec¬ 
tra of different parts of the source galaxy. Naturally, a disk- 
integrated spectrum would be insufficient for this purpose. 
However, one could make do with poorer spectral resolution. 

The secondary image is inverted relative to the primary, 
providing an important consistency check if both images 
were used for such a determination. We strongly recommend 
doing this, because an error would lead to a 200% error in 
the calculation of the DME. The chance of this is minimised 
with two determinations of the sense of rotation. 

Finally, we also need VA, which must come from a 
lensing reconstruction. 

5.1 Additional Information From Detailed 
Spectral Line Profiles 

It is often possible to extract more information from a spec¬ 
tral line than just the location of its centroid. The width of 
the line profile can be used to estimate e.g. Vmax sin*. 

The MCE simply shifts the entire spectrum. The DME 
leads to a ‘tilt’ being introduced because one side of the 
galaxy is magnified more than the other. These effects are 
different. Therefore, detailed line profiles can tell us if the 
shift in the centroid of spectral lines is due to the MCE or the 
DME. This would avoid the need to determine parameters 
like the disk scale length and orientation. A detailed lensing 
reconstruction to determine VA would also be avoided. 

We investigate how the DME and MCE affect line pro¬ 
files of disk galaxies with rotation curves parametrised by 
Equation [36] We assume the galaxy is viewed edge-on, so 
the radial velocity of any part of it is 

v r {r, (/)) = v c [r) sin(0) (41) 

The resulting radial velocity map is shown in Figure [5] 
Only half of the galaxy is shown because v r is antisymmetric 


V r 



Figure 5. Radial velocity map of a disk galaxy viewed by an 
observer within its plane at large x, for the case k = 2.5. Ra¬ 
dial velocities are antisymmetric about the x-axis. The radial co¬ 
ordinate is rescaled so all parts of the figure would be equally 
bright. The units are such that = 1 and V f = 1. Note the large 
region with v r close to the maximum value. The result for k = 0.5 
is very similar, although v max is much closer to Vf. 



Figure 6. The synthetic line profile of an intrinsically narrow line 
in an unlensed galaxy with k = 2.5, viewed edge-on. The profile 
is symmetric about v r = 0. Velocities are scaled to the flatline 
level Vf. The sharp drop in the line profile (blue) would probably 
get blurred (e.g. by random motions), so we convolved the profile 
with Gaussians of widths a. The results are shown as red lines 
with thicknesses oc a. Notice how all 4 profiles pass close to the 
point marked B. The result for k = 0.5 is similar, if the profiles 
are scaled to have the same Vmax rather than Vf. 

about the viewing direction (the a;-axis). v r is symmetric 
about the y-axis, because sin</ = sin (-7T — (j >). 

To determine the profile of a narrow spectral line, we 
divide the galaxy up into a large number of elements. We 
use cylindrical polars so v r becomes separable. Thus, the 
rotation speed only needs to be calculated once at each r 
(for all </>). The radial velocity at the centre of each element 
is used to classify it among 200 bins in radial velocity. 

Assuming constant M/L for the baryons, the total in¬ 
tensity of the element multiplied by the magnification A is 
then assigned to the corresponding radial velocity bin. Be¬ 
cause radial velocities and wavelengths are directly related, 
in this way one obtains a synthetic line profile. 

Spectral lines have an intrinsic width and can be fur¬ 
ther broadened by random motions within the galaxy. To 
account for these effects and also for instrumental errors, 
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Figure 7. The residuals in the spectral profile due to the DME 
(Equation |43| l and the MCE (horizontal shift of profile) are shown 
here. These were obtained by subtracting a control line profile 
(Equation | i 2 ( . The patterns are antisymmetric about v T = 0. 
Results are for an edge-on galaxy with fc = 0.5 (top) and k = 2.5 
(bottom). Both effects change the mean redshift by 1% of the 
maximum rotation speed, representing 1.08% of Vf for k = 0.5 
and 1.26% for k = 2.5. The spectra were convolved with Gaus- 
sians of widths 0.05, 0.1 and 0.2 Vf (higher a indicated by thicker 
line). The MCE can’t change the amplitudes of the horns. The 
DME makes one more pronounced and the other less. 


we convolved our synthetic line profiles with Gaussians of 
various widths a. The results are shown in Figure [6] The 
sharp peaks at v r ~ ±v marc sin i give rise to the name of a 
double-horned profile. 

These horns are caused by the rotation curve having 
a peak, leading to a small range in v r corresponding to a 
large range in r. The greatest attained values of |tv| also 
correspond to large ranges in tj), because siruf is nearly in¬ 
dependent of <j> when 0 ~ ± ^. 

Thus, a small range in v T corresponds to a large region 
in the galaxy. Moreover, the peak rotation speed occurs at 
a radius close to that which maximises the light emitted 
per unit radius (r = r^). Figure [5] shows the ‘bull’s-eye’ 
corresponding to the fairly large region with near-maximal 
|v r |. This is responsible for the very pronounced horns in the 
line profile. They are somewhat less pronounced at high a. 

Although one might expect a feature corresponding to 
Vf (at least at low cr), this is absent. A quick look at Figure 


[5] shows why: v c ( r ) m Vf only for sufficiently large r. There 
is very little light from such regions, so a disk-integrated 
spectrum is hardly sensitive to them. In fact, due to the steep 
decline in surface brightness with r, most of the spectral 
intensity at v r = Vf actually comes from the rising part of 
the rotation curve (when v c sin <f> = Vf) rather than from the 
flat part. Thus, in the line profile, there is nothing special 
about Vf. This is not true for v max . 

Determining v m ax sin i from a line profile is non-trivial 
as the horns move to lower |rv| as a increases. Instead of 
using the horn positions, one could use the values of v r where 
the intensity is a certain fraction of the intensity at the line 
centre ( v r = 0). If this fraction is chosen carefully, then one 
could simply scale the resulting v r by a constant factor and 
accurately recover Vmax sin i over a wide range in a and k. 
To see why, note that spectra with different a all pass close 
to the point marked B in Figure [6] 

Ultimately, it might be better to compare the observed 
line profile with a suite of synthetic profiles built for a range 
of fc, a and VmaxSini. The initial guess for a might come 
from considering the shape of the tail. If Vmax sin i is accu¬ 
rately recovered, then the DME hardly depends on k. 0 

The horns are caused by a relatively small part of the 
galaxy but they greatly affect the mean radial velocity of 
its image. Thus, if the galaxy was not axisymmetric and 
e.g. had a dusty spiral arm obscuring light from this region, 
then the redshift measurement of each image may be biased. 
Partly for this reason, it may be a good idea to consider the 
rest of the line profile and not just the mean redshift (which 
is basically the same as considering just the horns). 

We now consider how the DME and MCE affect the line 
profile. The mean redshift velocity of the line is raised by 1% 
of v max (1.08% of Vf for k = 0.5 and 1.26% for k = 2.5). 
We consider separately the cases where either the DME or 
the MCE is wholly responsible for this shift in line centroid. 
We also construct control line profiles like those in Figure [6] 
These are obtained by setting 

A = 1 V r , 0 (42) 

In FigureJT] we show the pattern of residuals (relative to 
the control) created by each effect. The total line intensity 
is kept the same for the comparison. 

To obtain the corresponding observations, one would 
need to account for the images having different overall mag¬ 
nifications. Thus, the spectra would have to be rescaled. We 
assume this can be done perfectly (i.e. the photometry is 
very accurate). 

The MCE corresponds to a horizontal shift in the spec¬ 
trum relative to the control. This means the amplitudes 
of the horns are unaffected. The pattern of residuals cor¬ 
responds to the gradient in the spectrum. Thus, the resid¬ 
uals are largest near the positions of the horns, but small 
precisely at them. The residuals are of opposite signs on 
either side of each horn because the gradient in the spectrum 
changes sign there. 


1 Line profiles can also be used to find v f sin i without detailed 
rotation curves. In this case, the value of k is important as a 
‘E—correction’ must be applied to get from v m ax to Vf (e.g. 
bottom panel in Figure [3j . k doesn’t affect the line profile much 
and so it would need to come from an image & photometry. 
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For the DME, we set 

A=l + rirsm(f> (43) 

Note that no assumptions are made about any of the 
factors controlling the amplitude of the DME, beyond it 
being a small effect relative to VmaxSmi (i.e. n <g 1) and 
that we need not consider the second-order dependence of 
A on position in the source plane. The purpose here is to 
illustrate how the DME affects the line profile, not how much 
(this is controlled by n). If the DME ~0.01 Vf sin*, then the 
residuals would be ~1% of the line profile. 

The image overall is not magnified for any (small) n. We 
adjust n until the line centroid shifts by the correct amount, 
to allow comparison with the MCE. 

The DME causes one side of the galaxy to be magnified 
more than the other. Thus, the residuals due to it are of the 
same sign for each half of the galaxy (e.g. for v r >0). There 
is no change in sign at the horns. 

These correspond to material displaced from the centre 
of the galaxy along the direction OP in Figure [2] As argued 
previously, we only need to consider the component of Vd 
along this direction. Thus, the effects of differential magnifi¬ 
cation are substantial for the material corresponding to the 
horns (in so far as the DME affects the image at all). This is 
in contrast to the MCE, which hardly affects the line profile 
at these positions (because the gradient of the line profile 
there is 0). 

For some v r , the MCE leads to very large residuals 
if ^ ^ in . is low (Figure JtJ) . Thus, observing faster-rotating 
galaxies might make it easier to distinguish between the 
DME and the MCE (as ——r would likely be smaller). 
However, the DME would be larger and so it would have 
to be accounted for more accurately. 

Detailed profiles of individual spectral lines may there¬ 
fore help in determining the balance between the MCE and 
the DME in accounting for redshift differences between mul¬ 
tiple images. In reality, a large number of spectral lines 
would probably need to be stacked. Even then, it seems 
likely that, in so far as redshift differences between the im¬ 
ages are discernible, the cause of such differences can also 
be determined. 


5.2 The Second Order Effect 

For simplicity, we continue assuming the source is located 
along OP. Thus, regions with high |tv| are magnified more 
and regions with lower | v r | are magnified less than the centre 
of the source due to the second order dependence of A on 
position. To investigate what this means for spectral line 
profiles, we set A to depend quadratically on position along 
the direction OP. This introduces a sin 2 (j> dependence. 

A= l + nU sinV 

1 + 3n v ’ 

A quadratic dependence along the orthogonal direction 
would give a cos 2 4> dependence. Because cos 2 (j> = 1 — sin 2 efr, 
a second derivative of A in either direction would affect the 
line profile in the same way (i.e. the residuals would have the 
same pattern, up to sign); once any overall magnification was 
corrected for. 

When comparing the spectra, observers would first scale 



Figure 8. The pattern of residuals for the second-order DME 
(Equation |44| and k = 1. A control profile obtained using Equa- 
tion !42l was subtracted and the result convolved with a Gaussian. 
The residuals are symmetric about v r = 0, so both horns become 
more pronounced in this example. 


them to have equal intensities. Thus, we must avoid chang¬ 
ing the total intensity. This leads to the factor of 1 + 3n in 
the denominator of Equation |44| 

The results are shown in Figure [8] The effect is sym¬ 
metric in v r , so both horns are equally affected. These end 
up more pronounced in the secondary image than in the 
primary (in this example). 

In reality, both the first and second order DME would 
be present for any given pair of images of the same object. 
Thus, the residuals would have both an antisymmetric and 
a symmetric part. However, the latter would likely be very 
small for cluster mass lenses (except for caustic images). 


5.2.1 Non-rotating sources 

We briefly consider how the DME might affect a non¬ 
rotating pressure-supported galaxy, such as an elliptical. 
If a galaxy is symmetric such that p(r) = p{—r) and 
cr(r) = a(—r), then at first order the DME does not affect 
an unresolved image at all. To see this, consider an inversion 
mapping r —> —r while leaving V4 unchanged. The situa¬ 
tion is identical to reversing the direction of VA instead, so 
one expects the DME to act in exactly the opposite way on 
the spectrum. But the situation has not physically changed, 
so the DME must also remain unchanged. 

This conclusion breaks down at second order. Suppose 
parts of the galaxy further from its centre are magnified 
more. Then, as the velocity dispersion generally decreases 
outwards, the derived velocity dispersion of the image will 
be reduced by the DME. The effect is larger for the fainter 
(secondary) image, which will thus appear to have a smaller 
velocity dispersion than the primary in this example. 

This is likely to be more important for galaxy-galaxy 
lensing as 8e is smaller, making du over the source larger. 
In this case, the DME might be useful to constrain the form 
of <r(r) using a double image of a distant elliptical galaxy. 

Alternatively, if the source galaxy was well-understood, 
one might be able to constrain Vi and thus have a better 
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understanding of the lens. Doing both simultaneously would 
likely be very challenging and model-dependent. 


accuracy required in the redshift measurements and the cos¬ 
mological distance to the source, this is perhaps not the best 
option at present. 


6 TARGETS WITH A SMALLER EFFECT 

Figure [4] shows that the Differential Magnification Effect 
may well need to be accounted for when using the redshifts of 
double images to determine the motion of the lens. However, 
doing this accurately may be difficult because of the cosmo¬ 
logical distance to the source galaxy. Therefore, we suggest 
sources for which the DME should be smaller, allowing us 
to correct for it less accurately. 

Some strategies outlined here involve selecting targets 
which are harder to observe, thereby making their spectra 
less accurate. It is up to observers to decide which targets 
best minimise the uncertainty introduced by the DME while 
still being feasible to obtain accurate spectra for. We also 
note that minimising the uncertainty introduced by correct¬ 
ing for the DME is not necessarily equivalent to minimising 
the magnitude of the DME, because there may be sources 
for which the DME can be estimated more reliably. 


6.1 QSOs 


The DME oc rv, where the source has typical size r and 
radial velocity spread v. For a given mass M, the Virial The¬ 
orem gives v oc —Thus, the DME oc yjr. For sources with 
a particular M, the DME would be reduced if the source 
were smaller, even though it would spin faster. 

One obvious type of very small target visible over cos¬ 
mological distances is a quasi-stellar object (QSO). If a dou¬ 
bly imaged QSO could be found lensed by the Bullet Cluster, 
it might make an excellent target. 

QSO spectra can sometimes lack distinctive features 
which are required for precise redshift measurements. The 
Ly-a forest might provide a solution, but only if the same 
feature appeared in both images. Because the rays of light 
corresponding to the images diverge from the source]]] this is 
only feasible if the gas cloud causing the absorption feature 
was located fairly close to the QSO. 

Another problem might be that the small size of QSOs 
makes their radiation time-variable. Thus, the time delay 
between the images could make it difficult to compare their 
spectra. This might require observers to wait out the time 
delay, which would first have to be determined (though it 
could be estimated, perhaps using Equation 151. 


6.2 Smaller &: Fainter Galaxies 

The DME is proportional to both the rotation velocity and 
the size of the source galaxy. Brighter galaxies generally 
rotate faster (Tully & Fisher 1977), so targeting fainter 
galaxies might help. One advantage of this approach is that 
the number density of fainter galaxies is greater than for 
brighter ones (Schechter 1976). This makes it more likely 
that a suitably oriented multiple image can be found. 

However, it would be harder to obtain accurate spectra 
— and thus redshifts — for fainter targets. Given the high 


1 by an angle ( 6 >i - 0 2 ) 


6.3 Elliptical Galaxies 

Elliptical galaxies might make good targets as they usually 
rotate slower than spirals, if at all. They might be distin¬ 
guished using colour or image shape (though one might need 
to correct for distortion by the lens). The surface bright¬ 
ness declines outwards much more gradually for ellipticals 
than for spirals, potentially providing another way of finding 
them. 

Before conducting detailed observations, targets se¬ 
lected like this might be followed up to check if the spec¬ 
tral line profiles were double-horned (characteristic of ro¬ 
tation along the line of sight). A good target should have 
a Gaussian-looking line profile, characteristic of a pressure- 
supported object. 

However, even ellipticals can rotate, so the DME might 
not be eliminated by observing one. Also, most galaxies are 
not elliptical, so finding a bright doubly-imaged one is some¬ 
what dependent on luck. Nonetheless, we consider this the 
. This is partly because the work of |Gonzalez| 
) identified a multiply imaged galaxy which may 
be a good target for determining the MCE. 


best option 


et al. (2009 


6.4 Galaxy Orientation &; Viewing Angle 

Figure [2] shows the geometry of the situation. The radial 
velocity of any part of the galaxy is scaled by sini, so a 
face-on spiral could not have a redshift gradient across it 
and thus would be unaffected by the DME. 

Determining i requires an image of the source galaxy 
to determine its shape. One could imagine trying to select 
targets which look round. Even then, the target might be an 
elliptical galaxy with some rotation along the line of sight. 

The direction of VA is also very important. In the¬ 
ory, we should seek situations where VA is orthogonal to 
the major axis of the image. In such situations, an edge- 
on disk galaxy would appear as a line on the sky aligned 
orthogonally to the direction towards the lens. With more 
complicated lenses, the galaxy would appear as a line on the 
sky orthogonal to VA, which hopefully could be estimated 
using a lensing reconstruction. 


6.5 Galaxy Position & Caustics 


As was already pointed out in Molnar et al. (2013a I, the 


MCE is maximal for image separations aligned with the di¬ 
rection of the lens’ proper motion. As the collision is nearly 
in the plane of the sky, the likely direction of this motion is 
plain to see & observers should target double images sepa¬ 
rated approximately along this direction. 

In the simple lens model that we use, the DME oc 
, 1 while the MCE oc \fv? + 4. Thus, galaxies less 

V i* 2 +4 

closely aligned with the lens make better targets in terms 
of the systematic error of the DME. For such sources, the 
images are more widely separated. 

Unfortunately, sources with larger u make worse targets 
under a number of other considerations. Both images — but 
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especially the secondary — are fainter. This image also be¬ 
comes very close to the lens, making it more likely to be 
obscured. 

A lensing reconstruction could be used to suggest par¬ 
ticular locations where the magnification is nearly constant. 
A galaxy with double images near such locations might make 
a good target for measuring the MCE. The difficulty with 
this is that such ‘sweet spots’ might be small and not have 
any observable galaxies in them. Also, a magnification map 
of sufficient accuracy might be difficult to obtain. 

Regions where A varies rapidly with position enhance 
the DME. Caustics occur where the magnification of a small 
part of the source plane is infinity. This means that the mag¬ 
nification varies rapidly with position in the source plane, 
greatly increasing the DME. For this reason, it has been 
suggested to avoid caustic images (M olnar et al.| ;2013a). 

However, it may be worthwhile to try correcting for 
the DME in caustic images because they are generally very 
bright, making for more accurate spectra. The correction 
would need to be done very accurately in this case, because 
the MCE might be much smaller than the DME. Because 
this is likely to lead to controversy surrounding the measure¬ 
ments, we also recommend avoiding caustic images unless 
the observational case is compelling. 


6.6 Substructure Within The Source Galaxy 

If the source galaxy has e.g. a bright star-forming region 
which emits strongly in the UV while the rest of the galaxy 
does not, then another possibility arises. UV spectral lines 
would correspond to material in a small part of the galaxy. 
Consequently, different parts of it would have much the same 
radial velocity and the magnification across it would be more 
uniform than across the whole galaxy. This would reduce the 
DME for the UV lines. 

Thus, in this example, the redshifts for the images 
should be calculated using only the UV lines. In practice, one 
would exploit the fact that a small part of the galaxy should 
have only a narrow range of redshifts. Thus, one might use 
only spectral lines which have a similar redshift. If the intrin¬ 
sic linewidth was small, then the line should be very narrow 
as there would not be much rotational broadening. 

Another possibility is using spectral lines that are more 
prominent in the bulge of the galaxy (if it has one). The 
bulge is mostly pressure-supported with little rotation and 
is also much smaller than the whole galaxy. In this case, 
the spectral lines to use might be quite broad, but have a 
Gaussian line profile even if the galaxy is rotating (so most 
spectral lines have a double-horned profile). 

For this technique to be of much benefit, the galaxy 
needs to be quite inhomogeneous in some way. It might be 
difficult to tell whether it is from an image. Also, the tech¬ 
nique reduces the number of spectral lines that are used to 
calculate the redshift, making for less precise measurements. 
This makes it difficult to target fainter galaxies, perhaps 
forcing observers to choose between observing all of a fainter 
galaxy or (effectively) part of a brighter one. 

Any decision to restrict which spectral lines are used 
to determine the MCE should be justified based on more 
detailed observations of nearby galaxies. This increases con¬ 
fidence that the decision does indeed effectively restrict the 
observations to a small part of the source. 


Table 2. Input parameters used for the ALMA exposure time 
calculator, available at: 

https: //almascience.eso.org/proposing/sensitivity-calculator 

The dual polarisation mode should be used as polarisation 
is unimportant here. The angular resolution does not affect the 
result, which was 6.17 hours. 


Parameter 

Value 

Declination 

-56° 

Frequency 

150 GHz 

Bandwidth per polarisation 

100 m/s 

Water vapour column density 

Default: 5 th octile (1.796 mm) 

Number of antennae 

50 X 12 metre 

RMS sensitivity 

1.5 mjy 


Even if all usable spectral lines are used to measure the 
MCE, it is still likely that some lines are less affected by 
the DME than others. It may be important to allow for this 
in the analysis e.g. by grouping spectral lines based on their 
linewidth and shape and obtaining an inference on the MCE 
for each group. 

Due to the uncertainties introduced by such procedures, 
we recommend reducing the DME by careful choice of target 
so that the exact method used to correct for it does not much 
affect the inferred lens velocity. 


7 OBSERVATIONAL PROSPECTS 


We now consider the technical feasibility of detecting the 
MCE with high resolution spectroscopic measurements. The 


target we consider is presented in Gonzalez et al. (2009). 


This has a flux of ~100 mjy at mm wavelengths. Due to 
dust in the source galaxy, it is best to do the observations 
at such wavelengths. For this purpose, we consider using the 
Atacama Large Millimetre Array (ALMA). The Bullet Clus¬ 
ter rises to within ~35° of the zenith at this site (minimum 
airmass « 1.2). 

The parameters we supplied are given in Table [2] To 
clarify the tension with A CDM, it would be necessary to 
constrain the collision speed to within ~250 km/s (repre¬ 
senting an 8% accuracy if the actual speed is 3000 km/s). 
This corresponds to determining the redshift difference be¬ 
tween the images to 0.1 km/s. 

A flux accuracy of 1.5 mjy corresponds to ~2% ac¬ 
curacy near the peak of the spectral energy distribution. 
The online calculator suggests that this level of precision 
can be attained in just over 6 hours under typical weather 
conditions. Thus, we believe that a night with all 50 of the 
12 metre dishes might allow us to constrain, in this case, the 
proper motion of the main cluster. 

In principle, the sub-cluster’s motion can also be deter¬ 
mined using the MCE. However, we could not find known 
multiple images with separation close to the East-West di¬ 
rection, the likely direction of the collision. Thus, suitable 
multiple images would first need to be found around the sub¬ 
cluster. This might be accomplished using a fairly deep ex¬ 
posure with the Atacama Pathfinder Experiment (APEX) 
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or other telescopes. If suitable targets were found, they could 
be targeted for detailed spectroscopic follow-up. 

The actual direction of motion of each component of 
the Bullet Cluster is not known for certain. Thus, observers 
should target multiple images separated in roughly orthog¬ 
onal directions. Observing more than one object can also 
minimise systematics associated with details of the source 
and the lens. 

Once suitable targets are found, we believe that a few 
nights of observations should be sufficient. The field of view 
might be large enough that multiple images of different 
sources can be observed in the same pointing, reducing the 
required telescope time further. Observing the images si¬ 
multaneously can also reduce systematics associated with 
changing atmospheric conditions. 

The main difficulty would be in achieving a very accu¬ 
rate calibration of the spectra. However, it is the relative 
redshift between multiple images that is critical for deter¬ 
mining the MCE. Absolute redshifts are not needed very 
precisely. 


8 CONCLUSIONS 


The Moving Cluster Effect (MCE) may provide an essen¬ 
tially direct method to determine the tangential motion of 
high-z lensing clusters such as the Bullet Cluster, thereby 
clarifying the tension that appears to exist with A CDM 
(Molnar et al .|2013a I. This requires a precise determination 
of redshift differences between multiple images of the same 
object. 

We expect the MCE to cause multiple images created 
by the Bullet Cluster to have a redshift velocity difference 
of ~1 km/s. We find that, for multiple images of a realistic 
target, this level of accuracy should be feasible with a night 
on ALMA, using all 50 of its 12 metre dishes. To determine 
the motions of both the main and the sub-cluster, multiple 
pointings may be required. 

We considered the effect of the time delay between mul¬ 
tiple images. In an expanding Universe, this causes them to 
have different redshifts (the Differential Expansion Effect, 
DEE). However, this effect is second order in the deflection 
angle, whereas the MCE is first order. Thus, the DEE can 
be neglected compared with the MCE. 

The Differential Magnification Effect (DME) arises 
when observing an object with a redshift gradient across 
it, most likely due to rotation. The precise way in which 
the magnification varies across the source is different for 
different images. This leads to them having different mean 
redshifts. Under plausible circumstances, the effect is large 
enough that it must be considered when trying to infer the 
lens motion (Figure |f|. 

We consider various methods for determining how the 
DME affects image redshifts. All techniques require the pro¬ 
files of spectral lines, if only to estimate the redshift gradient 
across the image based on the linewidth. If the line profile 
could be observed in more detail, then one could exploit 
the fact that the DME and MCE affect the line profile in 
different ways (Figure [Tjl. 

Otherwise, the DME could be estimated by determin¬ 
ing the parameters which control it (disk scale length & 
orientation, maximum line of sight rotation speed & sense 


of rotation, how magnification varies with position in the 
source plane for each image and, to a smaller extent, the 
source surface density). 

The DME is smaller for some sources than for others. 
We discuss which types of source might reduce the DME 
in Section [6] We believe the best option is to use multiple 
images of an elliptical galaxy as these are likely to rotate 
slower than spirals, if at all. In particular, the triple image 
identified in Gonzalez et al. (2009) might be a good source 
to observe. 

The DME is larger for lower mass lenses, making it 
more important for galaxy-galaxy lensing. Measuring pecu¬ 
liar velocities of galaxies using the MCE might thus be very 
challenging, especially as these are likely smaller than for 
the Bullet Cluster. 

However, the DME might be easier to observe. This 
might give more information about the gravitational poten¬ 
tial of the lensing galaxy and perhaps the redshift structure 
of the source. We speculate that the DME might provide a 
way to estimate the radial gradient in the velocity dispersion 
of a distant lensed elliptical galaxy. 

The authors wish to thank the referee for several very 
useful comments. Thanks also to Keith Horne and Douglas 
Buisson for helpful discussions. IB is supported by a Science 
and Technology Facilities Council studentship. 
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